Assigning Polarity Automatically to the Synsets of a Wordnet-like Resource
نویسندگان
چکیده
This article describes work towards the automatic creation of a conceptual polarity lexicon for Portuguese. For this purpose, we take advantage of a polarity lexicon based on single lemmas to assign polarities to the synsets of a wordnet-like resource. We assume that each synset has the polarity of the majority of its lemmas, given by the initial lexicon. After that, polarity is propagated to other synsets, through different types of semantic relations. The relation types used were selected after manual evaluation. The main result of this work is a lexicon with more than 10,000 synsets with an assigned polarity, with accuracy of 70% or 79%, depending on the human evaluator. For Portuguese, this is the first synset-based polarity lexicon we are aware of. In addition to this contribution, the presented approach can be applied to create similar resources for other languages. 1998 ACM Subject Classification I.2.7 Natural Language Processing
منابع مشابه
Automatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملQ-WordNet: Extracting Polarity from WordNet Senses
This paper presents Q-WordNet, a lexical resource consisting of WordNet senses automatically annotated by positive and negative polarity. Polarity classification amounts to decide whether a text (sense, sentence, etc.) may be associated to positive or negative connotations. Polarity classification is becoming important for applications such as Opinion Mining and Sentiment Analysis, which facili...
متن کاملAutomatically constructing Wordnet Synsets
Manually constructing a Wordnet is a difficult task, needing years of experts’ time. As a first step to automatically construct full Wordnets, we propose approaches to generate Wordnet synsets for languages both resource-rich and resource-poor, using publicly available Wordnets, a machine translator and/or a single bilingual dictionary. Our algorithms translate synsets of existing Wordnets to a...
متن کاملContext-sensitive Sentiment Propagation in WordNet
Current state of emotive annotation of plWordNet: •more than 83k annotations covering more than 54k lexical units and 41k synsets • 22k polarity annotations different than neutral (13k of lexical units and 9k synsets) • 1.5k synsets with different polarity across their units –without neutral units, only 345 of synsets with varying polarity strength –without neutral and ambiguous annotations, on...
متن کاملEnriching a Portuguese WordNet using Synonyms from a Monolingual Dictionary
In this article we present an exploratory approach to enrich a WordNet-like lexical ontology with the synonyms present in a standard monolingual Portuguese dictionary. The dictionary was converted from PDF into XML and senses were automatically identified and annotated. This allowed us to extract them, independently of definitions, and to create sets of synonyms (synsets). These synsets were th...
متن کامل